METHODS FOR ANALYZING GENE EXPRESSION PATTERNS 



RELATED APPLICATIONS 
[0001] This application claims priority under 35 U.S.C. § 119(e) to USSN 

60/245,081, filed November 1, 2000, which is incorporated by reference in their entirety 
for all purposes. 

BACKGROUND OF THE INVENTION 
[0002] The present invention generally relates to systems and methods for 

facilitating the identification of disease associated genes. In particular, the invention 
relates to improved techniques for analyzing gene expression patterns to discover disease 
associated genes. The invention also relates to three novel cancer-associated genes 
identified by the method and their corresponding polypeptides and to the use of these 
biomolecules in diagnosis, prognosis, treatment, prevention, and evaluation of therapies 
for diseases, particularly diseases associated with cell proliferation, such as cancer. 
[0003] The DNA sequences of many human genes have been determined, but for 

many of these genes, their biological function, and in particular their relationship to 
disease, is unknown or poorly understood. Current laboratory and computational 
methods to determine new methods that provide additional information on function are 
desirable. 

[0004] The recent development of complementary DNA micro-array technology 

provides a powerful analytical tool for human genetic research (M. Schena, D. Shalon, R. 
W. Davis, and P. O. Brown, "Quantitative monitoring of gene expression patterns with a 
complementary DNA microarray," Science, 270(5235), 467-70, 1995). One of its basic 
applications is to quantitatively analyze fluorescence signals that represent the relative 
abundance of mRNA from two distinct tissue samples. cDNA micro-arrays are prepared 
by automatically printing thousands of cDNAs in an array format on glass microscope 
slides, which provide gene-specific hybridization targets. Two different samples (of 
mRNA) can be labeled with different fluors and then co-hybridized on to each arrayed 
gene. Ratios of gene-expression levels between the samples are calculated and used to 
detect meaningfully different expression levels between the samples for a given gene. 
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Such monitoring technologies have been applied to the identification of genes which are 
up regulated or down regulated in various diseased or physiological states, the analyses 
of members of signaling cellular states, and the identification of targets for various drugs. 
[0005] The various characteristics of this analytic scheme make it particularly 

useful for directly comparing the abundance of mRNAs present in two cell types. Visual 
inspection of such a comparison is sufficient to find genes where there is a very large 
differential rate of expression. 

[0006] Walker et al. (1999) Genome Research 91:1 198-1203 discusses a method 

for identifying genes associated with disease wherein the expression of genes in multiple 
cDNA libraries was examined. The method described therein allows one to perform a 
coexpression analysis on clone count data from sequencing. The statistical analysis is 
performed using a categorical method (i.e., present or absent in clone count data from a 
library) rather than analyzing expression as a continuous variable using linear or rank 
correlation. 

[0007] For single channel microarray data, one could conceivably define a 

threshold of detection and use the same categories as described in Walker. However, 
because of the high sensitivity level of the microarrays, most genes would be classified 
as present. In addition, the threshold to use is uncertain and the results would be 
sensitive to this. These factors would increase the effective information loss resulting 
from converting real valued data to categories. Thus, typically Pearson's or Spearman's 
correlational methods are used for the analysis of single channel microarray data. 
[0008] As with single channel, it is also not practical to categorize data for dual 

channel microarray data as present or absent. In addition, each channel of dual channel 
technology is not absolute; thus, further increasing the difficulty in defining the 
threshold. Moreover, the categories of absent or present are not appropriate when 
applied to channel ratios. 

[0009] A more thorough study of the changes in expression requires the ability to 

discern more subtle changes in expression level and the ability to determine whether 
observed differences are the result of random variation or whether they are likely to be 
meaningful changes. As such, there continues to be interest in the development of new 
methodologies of gene expression analysis, particularly for methodologies applicable to 
dual channel microarray technology. 
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SUMMARY OF THE INVENTION 
[0010] In one aspect, the present invention provides a method for identifying 

biomolecules, such as polynucleotides or polypeptides, useful in the diagnosis, 
prognosis, treatment, prevention, and evaluation of therapies for diseases. The method 
can also be employed for elucidating genes involved in a common regulatory pathway. 
[0011] The method comprises first characterizing expression patterns of 

polynucleotides and more particularly, mRNAs. The expressed polynucleotides 
comprise genes of known and unknown functions. The expression patterns can be 
obtained through the analysis of a plurality of dual channel microarray data or through 
£ the construction of dual channel data from single channel data and analysis of the 

5 resulting "synthetic" dual channel data. Second, the expression patterns of one or more 

5 function-specific genes are compared with the expression patterns of one or more of the 

El genes of unknown function to identify a subset of novel genes which have similar 

CO expression patterns to those of the function-specific genes. 

L [0012] The method compares the expression pattern of two genes by first 

generating an expression data vector for each gene. The vector comprises entries for 
P each gene wherein a differentially expressed gene is represented by a one and a non- 

differentially expressed gene by a zero. The vectors are then analyzed to determine 
whether the expression patterns of any of the genes are similar. Expression patterns are 
similar if a particular probability threshold is met. Preferably, the probability threshold 
is less than 10" 7 , and more preferably less than 10" 9 . 

[0013] In a preferred embodiment, the function-specific genes are 

disease-specific gene sequences including TNF-inducible chemokines, including human 
tumor necrosis factor alpha inducible protein A20; human cytokine (GRO-beta) mRNA; 
human IL-8; human GRO (growth regulated) gene; and human mRNA for GRS protein. 
Other disease-specific gene sequences include those involved with cancer of the 
digestive tract and/or colon, such as those listed in Table 4. These groups of 
disease-specific genes are used to identify other polynucleotides of unidentified function 
that are predominantly coexpressed with the disease-specific genes. The polynucleotides 
analyzed by the present invention can be expressed sequence tags (ESTs), assembled 
sequences, full length gene coding sequences, introns, regulatory regions, 5' untranslated 
regions, 3' untranslated regions and the like. 
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[0014] In a second aspect, the invention entails a substantially purified 

polynucleotide identified by the method of the present invention as being associated with 
cancer. In particular, the polynucleotide comprises a sequence selected from the group 
consisting of SEQ ID NOs:7, 13, or 17 or its complement or a variant having at least 
70% sequence identity to SEQ ID NOs: 7, 13, or 17 or a polynucleotide that hybridizes 
under stringent conditions to SEQ ID NOs: 7, 13, or 17 or a polynucleotide encoding 
SEQ ID NOs: 8, 14, or 18. The present invention also entails a polynucleotide 
comprising at least 18 consecutive nucleotides of a sequence provided above. The 
polynucleotide is suitable for use in diagnosis, treatment, prognosis, or prevention of a 
cancer. The polynucleotide is also suitable for the evaluation of therapies for cancer. 
[0015] In another aspect, the invention provides an expression vector comprising 

a polynucleotide described above, a host cell comprising the expression vector, and a 
method for detecting a target polynucleotide in a sample. 

[0016] In a further aspect, the invention provides a substantially purified 

polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQ ID NO:8, SEQ ID NO:14, and SEQ ID NO:16. The invention also provides a 
substantially purified polypeptide having at least 85% identity to SEQ ID NOs:8, 14, or 
18. Additionally, the invention also provides a sequence with at least 6 sequential amino 
acids of SEQ ID NOs:8, 14, or 18. 

[0017] The invention also provides a method for producing a substantially 

purified polypeptide comprising the amino acid sequence referred to above, and 
antibodies, agonists, and antagonists which specifically bind to the polypeptide. 
Pharmaceutical compositions comprising the polynucleotides or polypeptides of the 
invention are also contemplated. Methods for producing a polypeptide of the invention 
and methods for detecting a target polynucleotide complementary to a polynucleotide of 
the invention are also included. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0018] The accompanying drawings, which are incorporated in and form a part of 

this specification, illustrate embodiments of the invention and, together with the 
description, serve to explain the principles of the invention. 

[0019] Figure 1 shows a high level process flow for identifying novel genes that 

exhibit a statistically significant co-differential expression pattern with a target gene. 
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[0020] Figure 2 is a block diagram of a computer system that may be used to 

implement various aspects of this invention such as the algorithms for comparing 
expression patterns. 

BRIEF DESCRIPTION OF THE SEQUENCE LISTING 
[0021] The Sequence Listing, which is incorporated herein by reference in its 

entirety, provides exemplary disease-associated sequences including polynucleotide 
sequences, SEQ ID NOs: 7, 13, or 17, and polypeptide sequences, SEQ ID NOs: 8, 14, or 
18. Each sequence is identified by a sequence identification number (SEQ ID NO) 
and/or by the Incyte Clone number from which the sequence was first identified. 

DETAILED DESCRIPTION OF EXAMPLE EMBODIMENTS 
[0022] Reference will now be made in detail to the preferred embodiments of the 

invention. While the invention will be described in conjunction with preferred 
embodiments, it should be understood that such embodiments are not intended to limit 
the invention to these embodiments. On the contrary, the invention is intended to cover 
alternatives, modifications and equivalents which are included within the spirit and scope 
of the invention. For example, the invention will be described by referring to 
embodiments providing methods, compositions, data analysis systems and computer 
program products for discovering functional regions in a genome. However, the 
methods, compositions, computational analysis and computer program products may be 
useful for analyzing the sequences of other biological molecules, particularly those useful 
for comparing sequences when one sequence is known and the other is not. 
[0023] As used herein the specification, "a" or "an" may mean one or more. As 

used herein in the claim(s), when used in conjunction with the word "comprising", the 
words "a" or "an" may mean one or more than one. As used herein "another" may mean 
at least a second or more. 

[0024] One skilled in the art recognizes that when first substrate and second 

substrate are referenced herein that both the first and second substrates could be different 
substrates or that a single substrate is used in both cases. In the later case, after use of the 
substrate as the first substrate, the conditions on the substrate are changed such that the 
sequences hybridized on the first use are removed and the substrate is then used as the 
second substrate. 
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[0025] All patents and publications mentioned in the specification are indicative 

of the level of those skilled in the art to which the invention pertains. All patents and 
publications are herein incorporated by reference to the same extent as if each individual 
publication was specifically and individually indicated to be incorporated by reference. 

DEFINITIONS 

[0026] "NSEQ" refers generally to a polynucleotide sequence of the present 

invention, including SEQ ID NOs: 7, 13, and 17. "PSEQ" refers generally to a 
polypeptide sequence of the present invention, including SEQ ID NOs: 8, 14, and 18. 
C [0027] A "variant" refers to either a polynucleotide or a polypeptide whose 

O sequence diverges from SEQ ID NOs: 7, 13, or 17 or SEQ ID NOs: 8, 14, or 18, 

5 respectively. Polynucleotide sequence divergence may result from mutational changes 

| such as deletions, additions, and substitutions of one or more nucleotides; it may also 

S occur because of differences in codon usage. Each of these types of changes may occur 

U alone, or in combination, one or more times in a given sequence. Polypeptide variants 

Si include sequences that possess at least one structural or functional characteristic of SEQ 

H ID NOs: 8, 14, or 18. 

P [002 8] "Gene" or "gene sequence" refers to the partial or complete coding 

sequence of a gene. The term also refers to 5' or 3' untranslated regions. The gene may 
be in a sense or antisense (complementary) orientation. 

[0029] "Disease-specific gene" refers to a gene sequence which has been 

previously identified as useful in the diagnosis, treatment, prognosis, or prevention of a 
disease, and more preferably, in the diagnosis, treatment, prognosis, or prevention of 
cancer. 

[0030] "Disease-associated gene" refers to a gene sequence whose expression 

pattern is similar to that of the disease-specific genes and which are useful in the 
diagnosis, treatment, prognosis, or prevention of disease. The gene sequences can also 
be used in the evaluation of therapies for disease. 

[0031] "Substantially purified" refers to a nucleic acid or an amino acid 

sequence that is removed from its natural environment and is isolated or separated, and is 
at least about 60% free, preferably about 75% free, and most preferably about 90% free 
from other components with which it is naturally present. 



Page 6 



THE INVENTION 

[00321 The present invention encompasses a method for identifying biomolecules 

that are associated with a specific disease, regulatory pathway, subcellular compartment, 
cell type, tissue type, or species. In particular, the method identifies gene sequences 
useful in diagnosis, prognosis, treatment, prevention, and evaluation of therapies for 
various diseases. 

[0033] The method entails first identifying polynucleotides (or mRNAs) that are 

expressed in a biological system of interest. The polynucleotides include genes of 
known function, genes known to be specifically expressed in a specific disease process, 
subcellular compartment, cell type, tissue type, or species. Additionally, the 
polynucleotides include genes of unknown function. The expression patterns of the 
known genes are then compared with those of the genes of unknown function to 
| determine whether a specified probability threshold is met. Through this comparison, a 

subset of the polynucleotides having a high probability of being co-differentially 
H expressed with the known genes can be identified. The high probability correlates with a 

£ particular probability threshold which is less than 10" 7 , and more preferably less than 10 9 . 
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THE MICRO ARRAYS 
[0034] The polynucleotides that are deposited as targets on the microarrays 

originate from cDNA libraries derived from a variety of sources including, but not 
limited to, eukaryotes such as human, mouse, rat, dog, monkey, plant, and yeast and 
prokaryotes such as bacteria and viruses. These polynucleotides can also be selected 
from a variety of sequence types including, but not limited to, expressed sequence tags 
(ESTs), assembled polynucleotide sequences, full length gene coding regions, introns, 
regulatory sequences, 5' untranslated regions, and 3 1 untranslated regions. 
[0035] The microarrays comprise polynucleotides from cDNA libraries obtained 

from blood vessels, heart, blood cells, cultured cells, connective tissue, epithelium, islets 
of Langerhans, neurons, phagocytes, biliary tract, esophagus, gastrointestinal system, 
liver, pancreas, fetus, placenta, chromaffin system, endocrine glands, ovary, uterus, 
penis, prostate, seminal vesicles, testis, bone marrow, immune system, cartilage, 
muscles, skeleton, central nervous system, ganglia, neuroglia, neurosecretory system, 
peripheral nervous system, bronchus, larynx, lung, nose, pleurus, ear, eye, mouth, 
pharynx, exocrine glands, bladder, kidney, ureter, and the like. 
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[0036] In a preferred embodiment, gene sequences are assembled to reflect 

related sequences, such as assembled sequence fragments derived from a single 
transcript. Assembly of the polynucleotide sequences can be performed using sequences 
of various types including, but not limited to, ESTs, extensions, or shotgun sequences. 
In a most preferred embodiment, the polynucleotide sequences are derived from human 
sequences that have been assembled using the algorithm disclosed in "Database and 
System for Storing, Comparing and Displaying Related Biomolecular Sequence 
Information", Lincoln et al., Serial No:60/079,469, filed March 26, 1998, herem 
incorporated by reference. 

EVALUATION OF DIFFERENTIAL EXPRESSION 
[0037] Experimentally, differential expression of the polynucleotides can be 

evaluated by methods including, but not limited to, differential display by spatial 
immobilization or by gel electrophoresis, genome mismatch scanning, representational 
difference analysis, and transcript imaging. Additionally, differential expression can be 
assessed by microarray technology. These methods may be used alone or in combination. 
[0038] Preferably, a microarray is created by arraying individual polynucleotides 

on a substrate with each gene occupying a unique location. Differential expression is 
assessed by dual channel microarray technology. More specifically, samples of mRNA 
from treated cells are purified, fluorescently labeled, and competitively hybridized 
against an untreated reference sample labeled with a different fluorochrome. After 
hybridization and washing, the microarrays are scanned for the two different fluorescent 
labels. 

[0039] Image-processing algorithms calculate the signal generated from each 

fluorescent probe on each element. More specifically, it has been found that the ratio of 
the two fluorescent intensities provides a highly accurate and quantitative measurement 
of the relative gene expression level in the two cell samples. For example, if a 
microarray element shows no fluorescence, it indicates that the gene in that element was 
not expressed in either cell sample. If an element shows a single color, it indicates that a 
labeled gene was expressed only in that cell sample. The appearance of both colors 
indicates that the gene was expressed in both cell samples. Even genes expressed once 
per cell (1 part in 100,000 sensitivity) can be detected using this technology. Two-fold 
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changes of expression intensity are also readily detectable. Expression rauos ean 
be ealcalated for those elements with sufficient signal in at least one channel. 
,0040. The number of microarray images used in the analyses ean range font as 

few aa 20 to greater than 10,000. Preferably, the number of me dual channel microarray 
images used in the analyses described herein for estimating the probability that two 
polynucleotides are co-differentially expressed is greater than 200. 

STATISTICAL ANALYSIS OF CO-DIFFERENTIAL EXPRESSION 
,0041] A high level process flow 101 in accordance with one embodiment of this 

invention for identifying novel genes that exhibit a statisttcafly significant co-differenftal 
expression pattern with a targe, gene is depicted in Figure 1. The process begins a, 103 
with the dual channel microarray data. The data can be obtained directly usmg dual 
channel technology as described above or synthetic dual channel data ean be created by 
obtaining single channel data and taxing ratios between different microarray 
experiment, At 105, each gene sequence is then classified as either being drfferenttally 
expressed or as no. being differentially expressed. This determination may requrre a 
properly selected threshold for differential expression. In practice, a useful selechon of 
this threshold can be done empirically using techniques known in tire art and rs done 
commonly. See, e.g., US Paten. No. 6,245,517, which is incorporated herem by 
reference. Once tire microarray data has been classified into .he tnufuatly exclude 
categories of differentially expressed and no. differentially expressed, statistical analysrs 
can be performed to determine whether two genes are co-differentially expressed. 
,00421 To detennine whether two genes, A and B, have similar differenfial 

expression patterns, a. .07, expression data veCors ean be generated as illustrated m 
Table 1 wherein a differentially expressed gene is indicated by a one and a non- 
different'ially expressed gene by a zero. In ofher words, a "one" indicates ma, a gene , a 
differentially expressed a, a ratio ma, is greater than me threshold (e.g., +/- 2 fold) and . 
W indicates that a gene is no. dffferentialty expressed (e.g., shows less than a +/- 
fold change in expression between treated and untreated samples). 
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Table 1 . Expression data vectors for genes A and B 



Microarray 
Hybridization 1 



icroarray 
Hybridization 2 



Microarray 
Hybridization 3 



Microarray 
Hybridization 

N 

0 " 
0 



gene A 



1 



gene B 



[0043] For a given pair 

2x2 contingency table. 



of genes, the expression data vectors are summarized in a 



Table 2. Contingency table for co-differential expression of genes A 



andB 





Gene A 
2-fold +/- 


Gene A 
No change 


Total 


GeneB 
2-fold +/- 


8 


2 


10 


GeneB 
No change 


2 


18 


20 


Total 


10 


20 


30 



[0044] Table 2 presents co-differential expression data for gene A and gene B in 

a total of 30 libraries. Table 2 summarizes and presents 1) the number of times gene A 
and B both display a 2-fold increase or decrease, 2) the number of times gene A and B 
both show no change in expression; 3) the number of times gene A shows a 2-fold 
increase or decrease in expression while gene B shows no change, and 4) the number of 
times gene B shows a 2-fold increase or decrease in expression while gene A shows no 
change. The upper left entry is the number of times the two genes are differentially 
expressed, and the middle right entry is the number of times neither gene is differentially 
expressed. The off diagonal entries are the number of times one gene is differentmlly 
expressed while the other does not. 

[0045] The vectors are then analyzed at 109 to determine whether the expression 

patterns of any of the genes are similar. Expression patterns are similar if a particular 
probability threshold is met. The significance of gene co-differential expression is 
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evaluated using a probability method to measure a due-to-chance probability of the co- 
differential expression. The probability method can be the Fisher exact test, the 
chi-squared test, or the kappa test. These tests and examples of their applications are 
well known in the art and can be found in standard statistics texts (Agresti, A. (1990) 
Categorical Data Analysis. New York, NY, Wiley; Rice, J. A. (1988) Mathematical 
Statistics and Data Analysis. Pacific Grove, CA, Wadsworth & Brooks/Cole). A 
Bonferroni correction (Rice, supra, page 384) can also be applied in combination with 
one of the probability methods for correcting statistical results of one gene versus 
multiple other genes. 

[0046] This method of estimating the probability for co-differential expression of 

two genes makes several assumptions. The method assumes that the libraries are 
independent and are identically sampled. However, in practical situations, the selected 
cDNA libraries are not entirely independent because more than one library may be 
obtained from a single patient or tissue, and they are not entirely identically sampled 
because different numbers of cDNA's may be sequenced from each library (typically 
ranging from 5,000 to 10,000 cDNA's per library). In addition, because a Fisher exact 
probability is calculated for each gene versus 41,419 other genes, a Bonferroni correction 
for multiple statistical tests is necessary. 

[0047] The probability ("p-value") that the simultaneous 2-fold change in 

expression for gene A and gene B occurs due to chance as calculated using a Fisher exact 
test is 0.0003. In a preferred embodiment, the due-to-chance probability is measured by 
a Fisher exact test, and the threshold of the due-to-chance probability is set to less than 
10" 7 , more preferably less than 10" 9 . 

EXAMPLES 

[0048] Using the method of the present invention, five genes have been identified 

that exhibit strong association, or co-differential expression, with a known gene, human 
tumor necrosis factor alpha inducible protein A20. The results presented in Table 3 show 
that the expression of five genes, one of which is novel, have direct or indirect 
association with the expression of A20. Therefore, this novel gene can be used in the 
diagnosis, treatment, prognosis, or prevention of cancer, or in the evaluation of therapies 
for cancer. Further, the gene product of the novel gene is a potential therapeutic protein 
and target of anti-cancer therapeutics. 
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Table 3. Co-differential Expression Analysis with Protein A20 



|4 

D 

a 
a 

UJ 
01 
D 
00 

D 
'¥■■■ 
D 



P-value < 


3enbank 
Identifier 


Description 




2.9e-120 


G177865 


Human tumor necrosis factor 
alpha inducible protein A20 
(SEQIDNOs:l and 2) 


Blocks TNF-induced apoptosis. 

TnHiip^d hv TNF 

Inhibitor of NF-kappaB. 


3.0e-37 


G183628 


rluman cytokine (UKU-Deiaj 
mRNA (SEQ ID NOs:3 and 4) 


^"hpmntflrtir for neutornhilic 
granulocytes. Binds IL-8R. 
Induced bv TNF 


6.4e-36 


G179579 


Human IL-8 (SEQ ID NOs:5 
and 6) 


Activates neutrophil 
orarnilnrvtes Induced bv TNF. 


1.9e-34 


Not applicable 


SEQ ID NO:7 and SEQ ID 
NO:8 


TNF-inducible chemokine. 


4.9e-34 


Gl 83622 


Human GRO (growth regulated) 
gene (SEQ ID NOs:9 and 10) 


Neutrophil chemoattractant. 
Binds IL-8R. Induced by TNF. 


4.3e-25 


G1694788 


Human mRNA for GRS protein 
(SEQIDNOs:ll and 12) 


Blocks apoptosis by TNF, p53. 
Induced by TNF. 



[0049] Therefore, in one embodiment, the present invention encompasses a 

polynucleotide sequence comprising the sequence of SEQ ID NO:7. This polynucleotide 
has been shown by the method of the present invention to have strong association (or 
high probability for being co-differentially expressed) with a variety of TNF-inducible 
chemokines. The invention also encompasses a variant of the polynucleotide sequence 
and its complement. Variant polynucleotide sequences typically have at least about 70%, 
more preferably at least about 85%, and most preferably at least about 95% 
polynucleotide sequence identity to SEQ ID NO:7. 

[0050] Using the method of the present invention, eight genes that exhibit strong 

association, or co-differential expression, with a novel gene, SEQ ID NO: 13, have been 
identified. The results presented in Table 4 show that the expression of eight genes, one 
of which is novel, have direct or indirect association with the SEQ ID NO: 13. 
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Table 4. Co-differential Expression Analysis with Novel Gene SEQ ID NO: 13 



M 
D 
D 
D 

yi 
D 
Si 

N' 
D 
\+ 

O 



P-value 


Genbank Identifier 


Description 


3.5e-32 


-V T A 1 ' 11 

Not applicable 


oiiv *U JNJJS.i J ana 


3.2e-16 


G5726288 


Human calcim-activaated chloride channel (SEQ ID 
NUs:1j ana loj 


2.5e-ll 


Not applicable 


SEQ ID NOs: 17 and 18 


5.7e-ll 


G291963 


Human colon mucosa- associaiea v^rv^v 1AJJVi ^^ v JijV < 
IDNOs:19and 20) 


5.7e-ll 


G183414 


Human guanylin mKiN/\, compieie cas>. v^^v 
NOs:21 and 22) 


1.2e-10 


Gl 79792 


Human carbonic annyarase i ^w\±j 1J - 7 i>v^2>.^-> 
and 24) 


1.6e-10 


G409457 


Human calcium-aepenaent cnionuc i/iidiuici ^o^v 1XJ 
NOs:25 and 26) 


1.6e-10 


G4753765 


Human mRNA for UDP-glucuronosyltransferase 
(UGT) (SEQ ID NOs:27 and 28) 


4.4e-10 


G2385453 


Human mRNA for galectin-4 (SEQ ID NOs:29 and 30) 



[0051] Inspection of these results reveals that the majority of the genes are 

digestive tract / colon specific. In addition, three of the genes are associated with 
adenocarcinoma, including DRA or "Down Regulated in Adenoma". Chloride channel 
genes have also been associated with colon cancer, although these changes may be a side 
effect of the cancer rather than a mechanism of the cancer. It has also been shown that 
uroguanylin treatment suppresses polyp formation and induces apoptosis in human colon 
adenocarcinoma cells. As such, the analysis indicates that SEQ ID NO: 13 and SEQ ID 
NO: 17 may be involved with cancer of the digestive tract and/or colon. Therefore, these 
two novel genes can potentially be used in diagnosis, treatment, prognosis, or prevention 
of cancer, or in the evaluation of therapies for cancer. Further, the gene products of these 
two genes are potential therapeutic proteins and targets of anti-cancer therapeutics. 
[0052] Therefore, in one embodiment, the present invention encompasses a 

polynucleotide sequence comprising the sequence of SEQ ID NO: 13 or SEQ ID NO: 17. 
The invention also encompasses a variant of the polynucleotide sequence and its 
complement. Variant polynucleotide sequences typically have at least about 70%, more 
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preferably at least about 85%, and most preferably at least about 95% polynucleotide 
sequence identity to SEQ ID NO:13 or SEQ ID NO:17. 

[0053] One preferred method for identifying variants entails using NSEQ and/or 

PSEQ sequences to search against the GenBank primate (pri), rodent (rod), and 
mammalian (mam), vertebrate (vrtp), and eukaryote (eukp) databases, SwissProt, 
BLOCKS (Bairoch, A. et al. (1997) Nucleic Acids Res. 25:217-221), PFAM, and other 
databases that contain previously identified and annotated motifs, sequences, and gene 
functions. Methods that search for primary sequence patterns with secondary structure 
gap penalties (Smith, T. et al. (1992) Protein Engineering 5:35-51) as well as algorithms 
such as BLAST (Basic Local Alignment Search Tool; Altschul, S.F. (1993) J. Mol. Evol 
36:290-300; and Altschul et al. (1990) J. Mol. Biol. 215:403-410), BLOCKS (Henikoff 
S. and Henikoff G.J. (1991) Nucleic Acids Research 19:6565-6572), Hidden Markov 
Models (HMM; Eddy, S.R. (1996) Cur. Opin. Str. Biol. 6:361-365; and Sonnhammer, 
E.L.L. et al. (1997) Proteins 28:405-420), and the like, can be used to manipulate and 
analyze nucleotide and amino acid sequences. These databases, algorithms and other 
methods are well known in the art and are described in Ausubel, F.M. et al. (1997; Short 
Protocols in Molecular Biology, John Wiley & Sons, New York , NY )and in Meyers, 
R.A. (1995; Molecular Biology and Biotechnology, Wiley VCH, Inc, New York, NY, p 
856-853). 

[0054] Also encompassed by the invention are polynucleotide sequences that are 

capable of hybridizing to SEQ ID NO: 7, SEQ ID NO:13, and SEQ ID NO:17, and 
fragments thereof under stringent conditions. Stringent conditions can be denned by salt 
concentration, temperature, and other chemicals and conditions well known in the art. In 
particular, stringency can be increased by reducing the concentration of salt, or raising 
the hybridization temperature. 

[0055] For example, stringent salt concentration will ordinarily be less than about 

750 mM NaCl and 75 mM trisodium citrate, preferably less than about 500 mM NaCl 
and 50 mM trisodium citrate, and most preferably less than about 250 mM NaCl and 25 
mM trisodium citrate. Stringent temperature conditions will ordinarily include 
temperatures of at least about 30@C, more preferably of at least about 37@C, and most 
preferably of at least about 42@C. Varying additional parameters, such as hybridization 
time, the concentration of detergent (sodium dodecyl sulfate, SDS) or solvent 
(formamide), and the inclusion or exclusion of carrier DNA, are well known to those 
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skilled in the art. Additional variations on these conditions will be readily apparent to 
those skilled in the art (Wahl, G.M. and S.L. Berger (1987) Methods Enzymol. 
152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 152:507-511; Ausubel, F.M. et 
al. (1997) Short Protocols in Molecular Biology, John Wiley & Sons, New York, NY; 
and Sambrook, J. et al. (1989) Molecular Cloning, A Laboratory Manual, Cold Spring 
Harbor Press, Plainview, NY). 

[0056] NSEQ or the polynucleotide sequences encoding PSEQ can be extended 

utilizing a partial nucleotide sequence and employing various PCR-based methods 
known in the art to detect upstream sequences, such as promoters and regulatory 
elements. (See, e.g., Dieffenbach, C.W. and G.S. Dveksler (1995; PCR Primer, a 
Laboratory Manual, Cold Spring Harbor Press, Plainview, NY, pp.1-5; Sarkar, G. (1993; 
PCR Methods Applic. 2:318-322); Triglia, T. et al. (1988; Nucleic Acids Res. 16:8186); 
Lagerstrom, M. et al. (1991; PCR Methods Applic. 1:111-119); and Parker, J.D. et al. 
(1991; Nucleic Acids Res. 19:3055-306). Additionally, one may use PCR, nested 
primers, and PROMOTERFINDER libraries to walk genomic DNA (Clontech, Palo 
Alto, CA). This procedure avoids the need to screen libraries and is useful in finding 
intron/exon junctions. For all PCR-based methods, primers may be designed using 
commercially available software, such as OLIGO 4.06 Primer Analysis software 
(National Biosciences Inc., Plymouth MN) or another appropriate program, to be about 
18 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal 
to the template at temperatures of about 68@C to 72@C. 

[0057] In another aspect of the invention, NSEQ or the polynucleotide sequences 

encoding PSEQ can be cloned in recombinant DNA molecules that direct expression of 
PSEQ or the polypeptides encoded by NSEQ, or structural or functional fragments 
thereof, in appropriate host cells. Due to the inherent degeneracy of the genetic code, 
other DNA sequences which encode substantially the same or a functionally equivalent 
amino acid sequence may be produced and used to express the polypeptides of PSEQ or 
the polypeptides encoded by NSEQ. The nucleotide sequences of the present invention 
can be engineered using methods generally known in the art in order to alter the nucletide 
sequences for a variety of purposes including, but not limited to, modification of the 
cloning, processing, and/or expression of the gene product. DNA shuffling by random 
fragmentation and PCR reassembly of gene fragments and synthetic oligonucleotides 
may be used to engineer the nucleotide sequences. For example, 
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oligonucleotide-mediated site-directed mutagenesis may be used to introduce mutations 
that create new restriction sites, alter glycosylation patterns, change codon preference, 
produce splice variants, and so forth. 

[0058] In order to express a biologically active polypeptide encoded by NSEQ, 

NSEQ or the polynucleotide sequences encoding PSEQ, or derivatives thereof, may be 
inserted into an appropriate expression vector, i.e., a vector which contains the necessary 
elements for transcriptional and translation^ control of the inserted coding sequence in a 
suitable host. These elements include regulatory sequences, such as enhancers, 
constitutive and inducible promoters, and 5' and 3' untranslated regions in the vector and 
in NSEQ or polynucleotide sequences encoding PSEQ. Methods which are well known 
to those skilled in the art may be used to construct expression vectors containing NSEQ 
or polynucleotide sequences encoding PSEQ and appropriate transcriptional and 
translations control elements. These methods include in vitro recombinant DNA 
techniques, synthetic techniques, and in vivo genetic recombination. (See, e.g., 
Sambrook (supra) and Ausubel, (supra ). 

[0059] A variety of expression vector/host cell systems may be utilized to contain 

and express NSEQ or polynucleotide sequences encoding PSEQ. These include, but are 
not limited to, microorganisms such as bacteria transformed with recombinant 
bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with yeast 
expression vectors; insect cell systems infected with viral expression vectors 
(baculovirus); plant cell systems transformed with viral expression vectors, cauliflower 
mosaic virus (CaMV) or tobacco mosaic virus (TMV), or with bacterial expression 
vectors (Ti or pBR322 plasmids); or animal cell systems. The invention is not limited by 
the host cell employed. For long term production of recombinant proteins in mammalian 
systems, stable expression of a polypeptide encoded by NSEQ in cell lines is preferred. 
For example, NSEQ or sequences encoding PSEQ can be transformed into cell lines 
using expression vectors which may contain viral origins of replication and/or 
endogenous expression elements and a selectable marker gene on the same or on a 
separate vector. 

[0060] In general, host cells that contain NSEQ and that express PSEQ may be 

identified by a variety of procedures known to those of skill in the art. These procedures 
include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 
amplification, and protein bioassay or immunoassay techniques which include 
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membrane, solution, or chip based technologies for the detection and/or quantification of 
nucleic acid or protein sequences. Immunological methods for detecting and measunng 
the expression of PSEQ using either specific polyclonal or monoclonal antibodies are 
known in the art. Examples of such techniques include enzyme-linked immunosorbent 
assays (ELISAs), radioimmunoassays (RIAs), and fluorescence activated cell sorting 
(FACS). 

[0061] Host cells transformed with NSEQ or polynucleotide sequences encoding 

PSEQ may be cultured under conditions suitable for the expression and recovery of the 
protein from cell culture. The protein produced by a transformed cell may be secreted or 
retained intracellular* depending on the sequence and/or the vector used. As will be 
understood by those of skill in the art, expression vectors containing polynucleotides of 
NSEQ or polynucleotides encoding PSEQ may be designed to contain signal sequences 
which direct secretion of PSEQ or polypeptides encoded by NSEQ through a prokaryotic 
or eukaryotic cell membrane. 

[0062] " In addition, a host cell strain may be chosen for its ability to modulate 
expression of the inserted sequences or to process the expressed protein in the desired 
fashion Such modifications of the polypeptide include, but are not limited to, 
acetylation, carboxylation, glycosylation, phosphorylation, ligation, and acylation. 
Post-translational processing which cleaves a "prepro" form of the protein may also be 
used to specify protein targeting, folding, and/or activity. Different host cells which have 
specific cellular machinery and characteristic mechanisms for post-translational activities 
(e g CHO HeLa, MDCK, HEK293, and WI38), are available from the American Type 
Culture Collection (ATCC, Bethesda, MD) and may be chosen to ensure the correct 
modification and processing of the foreign protein. 

[0063] In another embodiment of the invention, natural, modified, or 

recombinant NSEQ or nucleic acid sequences encoding PSEQ are ligated to a 
heterologous sequence resulting in translation of a fusion protein containing heterologous 
protein moieties in any of the aforementioned host systems. Such heterologous protein 
moieties facilitate purification of fusion proteins using commercially available affinity 
matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), 
maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 
6-His, FLAG, c-myc, hemagglutinin (HA) and monoclonal antibody epitopes.. 
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[0064] In another embodiment, NSEQ or sequences encoding PSEQ are 

synthesized, in whole or in part, using chemical methods well known in the art. (See, 
e g Caruthers, M.H. et al. (1980) Nucl. Acids Res. Symp. Ser. 215-223; Horn, T. et al. 
(1980) Nucl. Acids Res. Symp. Ser. 225-232; and Ausubel, supra). Alternatively, PSEQ 
or a polypeptide sequence encoded by NSEQ itself, or a fragment thereof, may be 
synthesized using chemical methods. For example, peptide synthesis can be performed 
using various solid-phase techniques (Roberge, J.Y. et al. (1995) Science 269:202-204). 
Automated synthesis may be achieved using the ABI 431A Peptide Synthesizer (Perkm 
Elmer) Additionally, PSEQ or the amino acid sequence encoded by NSEQ, or any part 
thereof, may be altered during direct synthesis and/or combined with sequences from 
other proteins, or any part thereof, to produce a polypeptide variant. 
[0065] In another embodiment, the invention entails a substantially purified 

polypeptide comprising the amino acid sequence selected from the group consisting of 
SEQ ID NO:8, SEQ ID NO:14, SEQ ID NO:18, or fragments thereof. SEQ ID NO:8 is 
encoded by SEQ ID NO:7 and is a potential TNF-inducible chemokine. SEQ ID NO:18 
and SEQ ID NO:14 are encoded by SEQ ID NO:17 and SEQ ID NO:13, respectively and 
may be involved with cancer of the digestive tract and/or colon. 

DIAGNOSTICS and THERAPEUTICS 
[0066] The sequences of the these genes can be used in diagnosis, prognosis, 

treatment, prevention, and evaluation of therapies for diseases associated with cell 
proliferation, particularly cancer. Further, the amino acid sequences encoded by the 
novel genes are potential therapeutic proteins and targets of anti-cancer therapeutics. 
[0067] In one preferred embodiment, the polynucleotide sequences of NSEQ or 

the polynucleotides encoding PSEQ are used for diagnostic purposes to determine the 
absence, presence, and excess expression of PSEQ, and to monitor regulation of the 
levels of mRNA or the polypeptides encoded by NSEQ during therapeutic intervention. 
The polynucleotides may be at least 18 nucleotides long, complementary RNA and DNA 
molecules, branched nucleic acids, and peptide nucleic acids (PNAs). Alternatively, the 
polynucleotides are used to detect and quantitate gene expression in samples in which 
expression of PSEQ or the polypeptides encoded by NSEQ are correlated with disease. 
Additionally, NSEQ or the polynucleotides encoding PSEQ can be used to detect genetic 
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polymorphisms associated with a disease. These polymorphisms may be detected at the 
transcript cDNA or genomic level. 

[0068] The specificity of the probe, whether it is made from a highly specific 

region, e.g., the 5' regulatory region, or from a less specific region, e.g., a conserved 
motif and the stringency of the hybridization or amplification (maximal, high, 
intermediate, or low), will determine whether the probe identifies only naturally 
occurring sequences encoding PSEQ, allelic variants, or related sequences. 
[0069] Probes may also be used for the detection of related sequences, and should 

preferably have at least 50% sequence identity to any of the NSEQ or PSEQ-encoding 
sequences. 

[0070] Means for producing specific hybridization probes for DNAs encoding 

PSEQ include the cloning of NSEQ or polynucleotide sequences encoding PSEQ into 
vectors for the production of mRNA probes. Such vectors are known in the art, are 
commercially available, and may be used to synthesize RNA probes in vitro by means of 
the addition of the appropriate RNA polymerases and the appropriate labeled nucleotides. 
Hybridization probes may be labeled by a variety of reporter groups, for example, by 
radionuclides such as 32 P or 35 S , or by enzymatic labels, such as alkaline phosphatase 
coupled to the probe via avidin/biotin coupling systems, by fluorescent labels and the 
like The polynucleotide sequences encoding PSEQ may be used in Southern or northern 
analysis, dot blot, or other membrane-based technologies; in PCR technologies^ in 
microarrays utilizing fluids or tissues from patients to detect altered PSEQ expression. 
Such qualitative or quantitative methods are well known in the art. 
[0071] NSEQ or the nucleotide sequences encoding PSEQ can be labeled by 

standard methods and added to a fluid or tissue sample from a patient under conditions 
suitable for the formation of hybridization complexes. After a suitable incubation period, 
the sample is washed and the signal is quantitated and compared with a standard value. 
If the amount of signal in the patient sample is significantly altered in comparison to the 
standard value then the presence of altered levels of nucleotide sequences of NSEQ and 
those encoding PSEQ in the sample indicates the presence of the associated disease. 
Such assays may also be used to evaluate the efficacy of a particular therapeutic 
treatment regimen in animal studies, in clinical trials, or to monitor the treatment of an 
individual patient. 
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I00721 Once ,he presence of a disease is estabUsher, and a treatment protect,, ts 

Litii, fiybndizafion or ampiincanon assays can be repeated on a regular^* 
d etermine if .he .eve, of expression in the patient begins to app— . « 
ob served in ,be norma, subject. The resufis ob,ained from success assays may be used 
t „ show ,he efficacy of neafinen, over a penod ranging from severatdays — 
,0073, The po,ynuc,eofides may be used for the diagnos.s of a vanery o drseaaes 

Uated wi.fi ce„ profiferafion inCuding cancer sncb as adenocarcinoma, Centra, 

T adrena, g,and, b.adder, bone, bone marrow, brain, breast, cervtx, gaU Madder 
^ gli a gasfioin.es.ina, haC heart, Mdney, fiver, to* musc,e, ovary pancrea, 

Z^. — •*-> «"-* ski - spleen - testis - " 

uterus 

|00741 Anernativefy, me po,ynuc,eo«des may be user, as targets in a nncroarray. 

Th e Itcroarray can be used to monitor the expression ,eve, of ,arge numbers of genes 
simult aneous,y and to identic spfice variants, mutations, and ^ 
information may be used to determine gene funcfion, ,o understand me genefic basts o a 
He, to diagle a disease, and to deveiop and monitor me acfivities of rherapeuUc 

Ts, ,n ye« anodrer alternate, P o,ynuc,eo,ides may be used to general 

ybridizafion probes usefh, in mapping me naturafiy occmring genomic sequence. 

Ireacen, in ,m nybridtzafio. (PISH, may be corrected with odrer 
c _e mapping ,ecfini q ues and genefic map da.a. (See e.g., H^n* 
(,995) in Meyers, R.A. (ed.) Molecular Bio.ogy and Bio,ochno,ogy, VCH Pubfishers 

New York, NY, pp. 965-968). JK pn™vk 
[0076) in anchor embodiment antibodies which spectficaUy bmd PSEQ may be 

sed for ,he diagnosis of diseases characterized by .fie over-o,u„derexpress,on of PSEQ 
or pCypepfides encoded by NSEQ. A,.emafive,y, one may use compete dru 
JZ -ys in whteh neutrafizing an.ibod.es capab.e of bfitding PS Q or 
polyP epfides encoded by NSEQ specificafiy compete with a tea, compound for bmdmg 
l^pepfides. fit this manner, antibodies - be used to de.ee. the presence of any 

pepLe wfiiefi shares one or more antigenic detemfinams with PSEQ or dte po,ypep«d. 

encoded by NSEQ. Diagnostic assays for PSEQ or the po.ypepfides enco « by NSEQ 

tadud e methods which utifize fire an.ibody and a ,abe, ,o detec. PSEQ or dte 
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p0 ,ypeptided encoded by NSEQ in human body fluids or in extracts of ee 
variety of protocois for measnring PSEQ or .he polypeptides encoded by NSEQ, 
mending ELISAs, RIAs, and FACS, are weU known in me art and provide . bas.s for 
diagnosing aUered or abnormal levels of ,he expression of PSEQ or the po^eptides 
encoded by NSEQ. Normal or standard vah.es for PSEQ expression are established by 
combining body fluids or eel, extracts taken from norma, subjects, preferably human, 
with antibody to PSEQ or a polypeptide encoded by NSEQ under conditions smtable for 
complex formation The amount of standard complex formation may be quantitated by 
various methods, preferably by photometric means. Quantities of PSEQ or me 
polypeptides encoded by NSEQ expressed in subject, control, and disease samp.es from 
biopsied tissues are compared with the standard vah.es. Deviation between standard and 
subject values establishes the parameters for diagnosing or monitonng dtsease. 
,0077, m another aspect, the polynucleotides and polypeptides of the present 

invention can be employed for treatment or the monitoring of therapeutic treatments for 
cancers. The polynucleotides of NSEQ or those encoding PSEQ, or any fragment or 
complement thereof, may be used for therapeutic purpose, In one aspect, the 
complement of the polynucleotides of NSEQ or those encoding PSEQ may be used tn 
situations in which it would be desirable to b.ock the transcription or translation of the 
mRNA. 

,00781 Expression vectors derived from retroviruses, adenoviruses, or herpes or 

vaccinia viruses, or from various bacterial plasmids, may be used for delivery of 
nucleotide sequences to the targeted organ, tissue, or eel, population. Methods 
wel , known to those skilled in me an can be used to construe, vectors to express nucle.c 
acid sequences complementary to the polynucleotides encoding PSEQ. (See, e.g., 
Sambrook, supra; and Ausubel, supra.) 

,00791 Genes having polynucleotide sequences of NSEQ or those encodmg 

PSEQ can be turned off by transforming a cell or tissue with expression vec.ors winch 
express high levels of a polynucleotide, or fragment thereof, encoding PSEQ. Such 
constructs may be used to introduce un.ransla.able sense or antisense sequences m«o a 
ce,l Oligonucleotides derived from the transcription initiation site, e.g., between about 
positions -10 and + 10 from the star, site, are preferred. Similarly, inhibition can be 
achieved using triple helix base-pairing methodology. Triple helix pairing ,s useflfl 
because it causes inhibition of the ability of the double helix to open sufficiently for the 
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binding of polymerases, transcription factors, or regulatory molecule, Recen, 
therapeutic advances using triplex DNA have been described in the literature. (See, e.g., 
Gee JE et al. (1994) in Huber, B E. and B.I. Carr, Molecular and Immunologrc 
Approaches, Fu«ura Publishing Co., Mt. Kisco, NY, pp. 163-177.) Ribozymes, enzymatic 
RNA molecules, may also be used to catalyze the specific cleavage of RNA. 
,0080] RNA molecules may be modified to increase intracellular stabihty and 

half-life Possible modifications include, but are no, limited to, me addition of flankmg 
sequences a. the 5' and/or 3' ends of .he molecule, or the use of phosphorottuoate or T O- 
m e«hyl rather man phosphodiesterase linkages within the backbone of the molecule. Ttns 
concept is inherent in the production of PNAs and can be extended in a), of these 
molecules by the inclusion of nontraditional bases such as inosine, queosme, and 
wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenme, 
cyrtdine, guanine, thymine, and uridine which are no. as easily recognized by 
endogenous endonucleases. 

,0081] Many methods for introducing vectors into cells or tissues are avertable 

and equally amiable for use in vivo, in vi«ro, and ex vivo. For ex vivo therapy, vectors 
may be introduced into stem cells taken from the patient and clonally propagated for 
autologous transplant back into that same patient. Delivery by transfection, by liposome 
injections, or by polycationic amino polymers may be achieved using methods whrch are 
wel l known in the art. (See, e.g., Goldman, C.K. e. al. (1997) Nature Biotechnology 
15:462-466.) 

,0082] Further, an antagonist or antibody of a polypeptide of PSEQ or encoded 

by NSEQ may be administered to a subject to treat or prevent a cancer associated wrth 
increased expression or activity of PSEQ. An antibody which specifically btnds the 
polypeptide may be used directly as an antagonist or indirectly as a targeting or dehvery 
mechanism for bringing a pharmaceutical agen. to cells or tissue which express the the 
polypeptide. 

,0083] Antibodies to PSEQ or polypeptides encoded by NSEQ may also be 

generated using methods mat are well known in the art. Such antibodies may include 
but are no. bruited to, polyclonal, monoclonal, chimeric, and single chain antibod.es, Fab 
fragments, and fragments produced by a Fab expression library. Neulralizing antibod.es 
(i e those which inhibit dimer formation) are especially preferred for .herapeutic use. 
Monoclonal antibodies to PSEQ may be prepared using any technique which provmes 
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for the production of antibody molecules by continuous cell lines in culture. These 
include, but are not limited to, the hybridoma technique, the human B-cell hybridoma 
technique, and the EBV-hybridoma technique. In addition, techniques developed for the 
production of chimeric antibodies can be used. (See, for example, Molecular Biology 
and Biotechnology, R.A. Myers, ed., (1995)John Wiley & Sons, Inc., New York, NY). 
Alternatively, techniques described for the production of single chain antibodies may be 
employed. Antibody fragments which contain specific binding sites for PSEQ or the 
polypeptide sequences encoded by NSEQ may also be generated. 

[0084] Various immunoassays may be used for screening to identify antibodies 

having the desired specificity. Numerous protocols for competitive binding or 
immunoradiometric assays using either polyclonal or monoclonal antibodies with 
established specificities are well known in the art. 

[0085] Yet further, an agonist of a polypeptide of PSEQ or that encoded by 

NSEQ may be administered to a subject to treat or prevent a cancer associated with 
decreased expression or activity of the polypeptide. 

[0086] An additional aspect of the invention relates to the administration of a 

pharmaceutical or sterile composition, in conjunction with a pharmaceutical^ acceptable 
carrier, for any of the therapeutic effects discussed above. Such pharmaceutical 
compositions may consist of polypeptides of PSEQ or those encoded by NSEQ, 
antibodies to the polypeptides, and mimetics, agonists, antagonists, or inhibitors of the 
polypeptides. The compositions may be administered alone or in combination with at 
least one other agent, such as a stabilizing compound, which may be administered in any 
sterile, biocompatible pharmaceutical carrier including, but not limited to, saline, 
buffered saline, dextrose, and water. The compositions may be administered to a patient 
alone, or in combination with other agents, drugs, or hormones. 

[0087] The pharmaceutical compositions utilized in this invention may be 

administered by any number of routes including, but not limited to, oral, intravenous, 
intramuscular, intra-arterial, intramedullary, intrathecal, intraventricular, transdermal, 
subcutaneous, intraperitoneal, intranasal, enteral, topical, sublingual, or rectal means. 
[0088] In addition to the active ingredients, these pharmaceutical compositions 

may contain suitable pharmaceutically-acceptable carriers comprising excipients and 
auxiliaries which facilitate processing of the active compounds into preparations which 
can be used pharmaceutical^. Further details on techniques for formulation and 
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c a ' tVi*» latest edition of Remington's Pharmaceutical 
administration may be found in the latest edition 

Sciences (Maack Publishing Co., Easton, PA). 

ST *» -» - ,herapeutteaiiy effective dose can : 

"I etther in ceU cutiure assays, ,,. of neop.as.ic ceUs or in anima, - -n as 
I! L, rabbits, dogs, or piss. A» anima, mode, may also be used «o de,emune , 
Z ^pnaie concenrmtion range and route of adm— Such InformaUon ean then 
bensedtodeterminensefuldosesandroutesforadministtationinhumans 

„,:, A therapeuticatiy effective dose refers .0 ft* amount of acfive msr«nen 

, i L of PSEO or those encoded by NSEQ, or fragments thereof, 

determined hy atartdard pharmaceutica, procedures m ceU cuhure, or „ 

annuals, such as by ca,cu,a«in g .he E0 5O (*e dose merapeu„ca„y effeettve 

to populadou) or LD 50 (ure dose lema. to 50% of the popu,atio„) satisfies. 

, 0 „ 911 Any of .he therapeutic methods described above may be apphed to any 

IZc, in need of such merapy, inc.udhrg, for examp.e, mammaU such as dogs, cats, 

cows, horses, rabbits, monkeys, and most preferably, humans. 

100921 APP T>erIl.y, embodiments of the present inven.ion emp.oy various 

Lbodiments of me present invention a,so re,a.e to an apparatus for performmg ti.se 
operations This apparatus may he speciaUy construct for me reared purposes, or t. 
I trrgene«se computer se,ective,y activated or reconfigured hy a computer 
maybeagener p „„„,„„ T he orocesses presented herein are 

program and/or data structure stored m the computer. The proc P 
uoUnherently reiated ,o any par«icu,ar computer or other apparatus, 
ariot genelpurpose machines may he used w«h programs written in accordance wtih 
hT lmugs h i, or i, may he more convenient ,0 construe, a more speeta^ 
iparauts to perform .he required metitod steps. A partis strucmre for a vanety of 
these machines will appear ftom the description given below. 

l009 „ h, addition, embodiments of the present invention relate to compufc 

eada ,e medta or computer program products tha. urc.ude program <-»-"^ 
data (including data stmemms) for performing various eomputer-tmp.emented 
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P^moles of computer-readable media include, but are not limited to, 

a • raraurtsiTui random access memory (Kivivi;. n» 
such as read-only memory devices (ROM) and rando 

pI „cessmg unrts, or CPUs) ar P ^ ^ ^ 

606 (typically arandom access memory, ^ ^^controllers 

only memory, or ROM). CPU 602 may ta0 f«~ W- • ^ 

and .nicroprocessors sncn as ^ microproceS sors. 

"* ^ Sat ; o :: o 3 J I and instmcUons nni- 

typica „y a secondary s,ora g e medium soch as a hard d k. Hw PP ^ 

• j tinp mass storage device 605, may, m <xyy^y 

information retamed wrthm *e mass sto g ^ ^ 

to the CPU. interfaC e 610 that connects to one or more 

rnnQSl CPU 602 is also coupled to an interlace oiu in 

„ uch as such as video monitors, track balls, mice, keyboards, 
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devices such as, of course, other confer, FinaUy, CPU 602 op t ionaUy may he 
coupled ,0 an externa, device such as a database or a computer or te.ecot—ttons 
us tag an externa, connection as shown generally a, 6,2^ ™ - ° 
section, I, is con.enrp.ated that the CPU might receive informal ^ 
„ r nugh, outpu, tnformation ,o the network in the course of performin g .he method s«eps 
described herein. 

I00961 In one embodiment, the comparer system 600 is directly coupled to an 

Cecnophoresis detection instrument. Data from the e.ectrophoresia detection instrument 
„ provided via interface 6.2 for ana,ysis hy system 600. Alternately, the data or 
.races processed hy system 600 are provided from a data ahorse sour* t such . . 
database or other repository. Again, the images are provide* via tnterface 6.2. Once 
the computer system 600, a memory device such as primary storage 606 or mass , . orag 
SOS buffers or stores, a, .east temporary, me data or trace image, With th, a a, *e 
imag e ana.ysis apparatus 600 can perform various ana.ysis operations such as — 
To mis end, the processor may perform various operattons on the stored 

images or data. - 
Jn I. is unders.ood that this invention is no, .imited to the parttcuta 

meU.odo.ogy, protocols, and reagents described, as these may vary. It is a,so understood 
« me term.no.ogy used herein is for the purpose of describing particular emhodmre* 
on,y, and is no, intended to Hmit the scope of the present invention whtch «... be hmued 
only by the appended claims. 

,00981 It is .o be understood that the above description .a .mended to be 

mustrative and no, restrictive. Many embodiment will be apparent to those skilled ,n 
th e art upon reviewing the above description. The scope of the inventton should 
before, be determined no, wi,h reference ,o the above description, bu, should mstead 
b e determined with reference to the appended Cairns, a.ong with me W scope of 
equivalents to which such claims are entitled. 



Page 26 



